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1 ^Detection of Protein Interactions 0 

3 Field of the Invention 

5 ,The present invention relates to a method of - 

6 detecting interactions between raacromolecules . In 

7 particular, but not exclusively, the invention < r 

8 relates to a method of detecting protein % i 
9' ' interactions using- fluorescence— — - ** 

10 

11 Background to the Invention 
12 

13 Protein to protein interactions play a key role in 

14 many biological processes including the assembly of 

15 enzymes, protein homo/he tero- oligomers, regulation 
IS of intracellular transport/ gene expression, 

17 receptor- ligand interactions, entry of pathogens 

18 into the cell and the action of small molecules or 

19 drugs . 
20 
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1 Identification and characterisation of 

2 macroraolecular interactions can be performed using 

3 co-immunoprecipitation from cell lysates and 

4 soiubilised merabranes. However, this technique 

5 requires specific antibodies for both capture and 

6 identification of proteins and may further require 

7 the use of detergent to disrupt interactions. 
B 

9 More recently non- invasive techniques have been 

10 developed to determine protein to protein 

11 interactions. 
12 

13 Such non~ invasive techniques were pioneered by the 

14 yeast two hybrid method which is based on 

15 complementation of a split yeast nuclear 

.16. . transcription .factor.. The yeast, two hybrid method 

17 utilises a mammalian bait protein connected to a 

18 yeast DNA-binding domain. This bait protein is used 

19 - to determine which prey proteins are able to bind to 
• • 20 the bait protein from a mixture of prey proteins, 

21 The mammalian prey protein is connected to a yeast 

22 transcription activation domain. When the mammalian 
23 -bait 'and prey proteins interact, the yeast DNA-" 

24 binding and transcription activating domains are 

25 brought together. The DNA binding domain can bind 

26 to the yeast DNA and the transcription activating 

27 domain is then suitably located to trigger the 

28 expression of a reporter gene encoding an enzyme 

29 which in turn can catalyse the production of a 

30 coloured product within the yeast cells thus 

31 indicating a successful interaction of bait with 

32 prey. 
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1 

2 The use of yeast expression systems to identify 

3 mammalian protein-to-protein interaction suffers 

4 from a number of flaws. Certain post-translational 

5 modifications, that are normally critical to 

6 mammalian protein interactions, cannot be achieved 

7 by yeast cells. For example, tyrosine 

8 phosphorylation is key to many mammalian 

9 intracellular protein binding events involved in 

10 signal transduction. However, the yeast genome 

11 contains no tyrosine kinase genes so 

12 phosphotyrosine- dependent protein interactions 

13 cannot be accessed in yeast two hybrid studies. * 
14 

15 Furthermore , in yeast two hybrid screening the '{ 

16 protein- complex must be able to translocate to the ^ 

17 nucleus to cause expression of the reporter gene or J 

18 cause downstream events to trigger the expression of - 

19 a reporter gene. Thus proteins that are excluded 

20 f rom' the yea~st nucleus will not be accessible to 

21 this screening method, 
22 

23 Further methods such as protein complementation and 

24 the split ubiguitin method utilise similar 

25 underlying concepts to the yeast two hybrid method 

26 in that the interaction of two proteins (a bait and 

27 prey protein) act to express a reporter protein, the 

28 reporter gene allowing the interaction event to be 

29 visualised as a detectable signal. 
30 

31 Such methods which utilise the expression of a 

32 reporter enzyme to produce a detectable signal 
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1 suffer from the disadvantage that the location of 

2 the protein complexes being detected cannot be 

3 accurately visualised in the cell - 

4 " 

5 Recently the technique of fluorescence energy 

6 transfer (FRET) has been used to determine protein 

7 - to protein interactions. In this technique the 
interaction of two fluorophores indicates their 

9 close spatial proximity. For protein to protein 

interaction monitoring the* addition of an absorbing 
moiety to one protein partner is complemented by the 
addition of a second fluorescing moiety to the 

13 second binding partner. Provided the emission 

14 spectrum of the absorbing moiety overlaps the 
excitation spectrum of the fluorescing moiety and 
both moieties are within 100A of each other FRET - 

17 will occur. Mutations in the sequence of green 

18 fluorescent protein (GFP) from the jellyfish 

19 Aeguorea victoria have been studied and shown to 

20 cause variations in the spectral emission of GFP 

21 giving rise to variants of GFP such as . Yellow 
Fluorescent Protein (YFP) , as well as cyan (CFP) and 
blue -<BFP) fluorescing variants. This technique uses- 
fluorescent energy transfer between these colour 
variants of GFP which are fused to interacting 
proteins to determine protein to protein 
interaction. Using this method, when the two GFP 

28 derived fluorophores are brought into close 

29 proximity, energy transfer between the fluorescent 

30 variants occurs and changes in fluorescence 

31 emissions are detected. Unfortunately, this method 

32 requires overexpression of the GFP fusion proteins 
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1 to allow quant ification of the small changes in 

2 fluorescence. Related methods to FRET require the 

3 use of irreversible photobleaching (FRAP) or 

4 expensive instruments capable of measuring 

5 fluorescence lifetime imaging (FLIM) „ 
6 

7 As a preliminary to the current experiments it was 

8 shown that green fluorescent protein can be 

9 engineered to acid amino acid residues at particular 

i- • . • * ■ - 

10 regions in the GFP sequence whilst fluorescence is 

11 retained. Further, it has been shown in Hn, CD, 

12 Chinenov, Y. and Kerppola, T. K. (2002). Mol „ Cell. 

13 9, 789-798 that using recombinant DNA technology 

14 specific Yellow Fluorescent Protein (YFP) fragments <> 

15 covalently fused to peptide sequences, which are ~ 

16 capable of interacting with each other can # 

17 reconstitute a fluorophore when the YFP fragments * 

18 are brought together, such that the peptide 

19 sequences could interact. 3 
" 2t) 

21 Further, it has recently been shown that 

22 fluorescence can be generated following the 

23 functional association of two separate fragments of' 

24 the GFP molecule (hapto-GFPs) when driven by the 

25 interaction of a pair of proteins fused via a linker 

26 to the new C and N' termini of the hapto-GFPs. 

27 " (Ghosh et al, (2000) ; "Hu et al, (2002). 
J28- 

29 However, the above methods suffer from the 

3 0 disadvantage that functional association of 

31 fluorescent fragments is limited by the constraints 

32 of stereochemistry imposed on the fragments by the 
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1 bait and prey proteins 1 association. If the fusion 

2 termini of the interacting partners are widely 

3 separated, productive association of the haptoGFPs 

4 will" not occur and no signal will be generated to 

5 indicate the interaction between the bait and prey 

6 peptides . 
7 

8 The present inventors have overcome a number of 

9 problems of the prior art * 
10 

11 Summary of the Invention 
12 

. 13 According to a first aspect of the present invention 

14 there is provided a protein interaction system said 

15 system comprising a first construct which encodes a 

. .... r 16 first- fragment of - .fluorescent -protein r a- first bait-- : - '-- 

17 peptide and a linker portion encoding at least 5 

18 amino acid residues interposed between- the first 

19 fragment and the bait peptide and a plurality of 

20 second constructs encoding a second fragment of 

21 fluorescent protein, a prey peptide and a linker 

22 portion encoding at least 5 amino acid residues 

- 23 interposed between the second fragment and the prey 

24 peptide and on interaction of the bait and a prey 

25 peptide the first and second fragments of the 

26 fluorescent protein complement each other such that 

27 functional association of the first and second 

< 

2 8 fragments promotes- fluorescence wherein at least 
29 ' two of the prey proteins have different amino acid 

3 0 sequences. 
31 
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Preferably all the prey proteins have different 
amino acid sequences ♦ 

Preferably the first and / or second construct 
comprises a linker portion which encodes between 15 
to 100 amino acid residues. 

Preferably the linker of the first and / or second 
construct is comprised of substantially hydrophillic 
amino- acid residues. 

More preferably the linker of the first and / or 
second construct is comprised of multiples of a 
pentapeptide sequence such as glycyl-glycyl-glycyl- 
glycyl - serine . 

More preferably the linker- of the first and / or 
second construct is greater than 20 amino acids, 
more preferably greater than 25 amino acids , more 
preferably greater than 30 amino acids, more 
preferably greater' than 35 amino acids, even more 
preferably greater than 4 0 amino acids , even more 
preferably greater than 50 amino acids and yet more 
preferably greater than 55 amino acids in length. 

Preferably,, the linker of the first and / or second 
construct encodes up to 60 amino acids. 

Where the peptides joined to the linkers are rod 
like structures and the peptides interact with each 
other with favourable topology of interaction, ie 
the peptides interact such that the fragments of 
fluorescent protein are brought into close proximity 
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1 with each other, short linker lengths are sufficient 

2 to allow screening for interaction partners. For 

3 example, short linkers could be used to screen a 
4 library of DNA binding proteins which from previous 

5 studies are known to be rod like in structure. 
6 

1 However, linker lengths between 15 to 100 amino 

8 acids are advantageous over shorter linker lengths 

9 as they allow bulkier peptides being tested for 

10 interaction to be conjoined to the first and second 

11 fragments of the fluorescent protein without the 

12 peptides being tested placing constraints on the 

13 functional association of the fluorescent proteins 

14 due to stereochemical hindrance. Such longer 

15 linkers are also advantageous to study small peptide 
- 16- pairs- that have- an- unfavourable topology of 

17 interaction such as is found in an anti -parallel 

18 complex (hapto-GFP-^-X? 1 .-binding to ;C 3 ->JI a -hapto- 

19 GFP) i.e. functional association of the interacting 

20 peptides causes the fluorescent 'fragments to be 

21 orientated such that they are directed away from 

22 each other in space. 

' • • 23' " ' 

24 Any fluorescent protein may be used in the 

25 invention. However, in a preferred embodiment the 

26 fragments of fluorescent protein are fragments of 

27 • green fluorescent protein, mutants or variants 

28 thereof. 
29 

30 More preferably the fluorescent protein is the 

31 humanised form of a fluorescent protein, e.g. 
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1 Enhanced Green Fluorescent Protein (EGFP) or a 

2 variant thereof . 
3 

4 Variants include peptides in which individual amino 

5 acids are substituted by other amino acids which are 

6 closely related as understood in the art, for 

7 example, substitution of one hydrophobic residue 

8 such as isoleucine, valine, leucine or methionine 

9 for another, or the substitution of one polar 

10- residue for another, such as arginine for lysine, 

11 glutamic for aspartic acid or glutamine for 

12 asparagine. . T; 

13 * 
.14 In a humanised nucleotide sequence one or more of 

15 the codons in the sequence are altered such that for $ 

16 the amino acid being encoded, the codon used is that - ■ 

17 which most frequently appears in humans. This is .i 

18 advantageous as the humanised fluorescent protein A 

19 construct e.g. (EGFP) has maximised expression 

2 0 levels and rate of flurophore formation in mamaliaii 

21 cells. This makes detection of fluorescence, 

22 produced by fragments of fluorescent proteins 

23 (fluorogenic fragments) which functionally associate 

24 with each other, easier to determine * 
25 

26 In a second aspect, there is provided a library of 

27 constructs encoding a fragment of "fluorescent 

28 protein, a peptide and a linker portion of at least 

29 5 amino acids interposed between said fragment and 

30 peptide -wherein said fragment of fluorescent protein 

31 is capable of functional association with a 

32 complementary fragment of fluorescent protein such 
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1 that on f tract ional association of said fragments 

2 fluorescence is enabled wherein the library of 

3 constructs encodes a plurality of different 

4 peptides . 
5 

6 Each member of the library encodes or provides a 

7 different peptide fused to a fragment of fluorescent 

8 protein via a linker. The peptides can be small 

9 peptides of differing amino acid .seguence, for.. . 

10 example nonomers , comprising different amino acid 

11 compositions or the same overall composition but 

12 with the amino acids present in a different order. 

13 Alternatively the peptides may be full size proteins 

14 obtained from a cDNA library. 
15 

16 -Preferably -the- constructs of the-library comprise "a " 

17 linker portion which encodes between 15 to 10 0 amino 

18 acid residues. 
19 

20 " Preferably the linker" is comprised of -substantially 

21 hydrophillic atnino-acid residues. 
22 

23 'More' preferably the linker " is comprised of multiples 

24 of a pentapeptide sequence such as glycyl-glycyl- 
2 5 glycy 1 -glycyl - serine . 

26 

27 More preferably the linker portion is greater than 

28 20 amino acids r more preferably greater than 25 

29 amino acids , more preferably greater than 30 amino 

30 acids , more preferably greater than 35 amino acids, 

31 even more preferably greater than 40 amino acids f 

32 even more preferably greater than 50 amino acids and 
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yet more preferably greater than 55 amino acids in 
length. 

Preferably , the linker encodes up to 60 amino acids. 

The invention further provides in a third aspect a 
library of polypeptides r each polypeptide comprising 
a fragment of fluorescent protein, a peptide and a 

*iy^ c 5F..£?^ „? mixl ° a.cid residues. 

interposed between the fragment and the peptide of 
the polypeptide. 

Unless the context demands otherwise , the term v * 
peptide, polypeptide and protein are used 4 
interchangeably to refer to amino acids in which the 
amino acid residues* are linked by covalent peptide ^ 
bonds or alternatively (where post-fcranslational 
processing has removed an internal segment) by 
covalent di -sulphide bonds, etc. The amino acid 
chains can be of any length and comprise at least 
two amino acids, they can include domains of 
proteins or full-length proteins. Unless otherwise 
stated the terms, peptide , polypeptide and protein 
also encompass various modified forms thereof, 
including but not limited to glycosylated forms, 
phosphorylated forms etc . 

Polypeptides may be made synthetically or 
recombinant ly using techniques which are widely 
available in the art. 
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1 In preferred embodiment s, the fragments of 

2 fluorescent protein (f luorogenic fragments) are 

3 generatable through the introduction of a split 

4 point between the amino acids at positions 157 and 

5 158, or {in a second embodiment) betweeji the amino 

6 acids at positions 172 and 173 of the humanised form 

7 of Green Fluorescent Protein {SSQ ID NO 1) . 
8 

9 SEQ ,ID NO 1 - EGPP (Clontech Inc^) [Genesbank 

10 Accession number gb;AAB02574 gi 1377912] : 

11 1 nrvskgeelft gwpilveld gdvnghkfsv sgegegdaty 

12 41 gkltlkf ict tgklpvpwpt Ivttltygvq cfsrypdhmk 

13 81 qhdffksamp egyvqertif fkddgnyktr aevkfegdtl 

14 121 vnrielkgid fkedgnilgh kleynynshn vyimadkqkn 

15 161 gikvnfkirh niedgsvqla dhyqqntpig dgpvllpdnh 
•16- 201 ylstqsalsk-dpnekrdhniv llefvtaagi tlgmdelyk" 
17 

18 The fluorogenic fragments generated by the 

19 introduction of a split point- between the amino acid 

20 ' residue's at positions 157 and 158 , or "between amino 
•21 acid residues at positions 172 and 173 , result in 

22 the production of hapto-EGFP 1 '^ 7 and hapto-EGFP 150/235 , 

23 ' or hapto-EGFP 37172 and hapto-EGFP 173/239 , respectively. 
24 

25 Alternative__split points are between residues 23/24 , 

26 38/39, 50/51. 76/77- 89/90. 102/103, 116/117, 

27 132/133, 142/143, 190/191, 21l/212 f 214/215 of EGFP- 
28 

29 Thus in preferred embodiments , the fragment 

3 0 comprises a fluorogenic fragment of amino acid 

31 residues 1 to 23, 1 to 38, 1 to 50, 1 to 76, 1 to 

32 89, 1 to 102, 1 to 116, 1 to 132, 1 to 142, 1 to 
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157 , 1 to 172, 1 to 190, 1 to 211, 1 to 214, 24 to 
239, 39 to 239, 51 to 239, 77 to 239, 90 to 239, 103 
to 239, 117 to 239, 133 to 239, 143 to 239, 158 to 

239, 173 to 239, 191 to 239, 212 to 239, or 215 to 

239 of EGFP. 

In one preferred embodiment a library of 
polypeptides according to a further aspect of the 

..invention, is provided wherein each member of the . 

library has a different peptide sequence fused to 

the fragment of fluorescent protein via the linker 

region . 

A bait peptide is a sequence of two or more amino 
acids, at least one domain of a protein or a full 
length protein* 

A prey peptide is a sequence of two or more amino 
acids, at least one domain of a protein or a full 
'length protein. * .... 

The term interaction or interacting as used herein 
means that two entitles, for example, distinct 
peptides, domains of proteins or complete proteins, 
exhibit sufficient physical affinity to each other 
so as to bring the two interacting entities 
physically close to each other. An extreme case of 
interaction is the. formation of. .a chemical bond that 
results in continual, stable proximity of the two 
entities. Interactions that are based solely on 
physical affinities, although usually more dynamic 
than chemically bonding interactions, can be equally 
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1 effective at co-localising independent entities. 

2 Physical affinities include, but are not limited to, 

3 for example electrical charge differences , 

4 hydrophobicity, hydrogen bonds, van der Waals force, 

5 ionic force, covalent linkages, and combinations 

6 thereof. The interacting entities may interact 

7 transiently or permanently. Interaction may be 

8 reversible or irreversible. In any event it is in 

. 9 .... contrast to and distinguishable from natural random - • — 

10 movement of two entities. Examples of interactions 

11 include specific interactions between antigen and 

12 antibody, ligand and receptor etc. 
13 

14 In a fourth aspect of the invention there is 

15 provided a protein interaction monitoring system, 

16 ' Scirid system comprising" a first polypeptide 

17 comprising a first fragment of fluorescent protein, 

18 a bait peptide and a linker portion of at least 5 

19 amino acid residues interposed between the first 
2 0 fragment and the bait peptide and a plurality of 

21 second polypeptides comprising a second fragment of 

22 fluorescent protein, a prey peptide and a linker 

23 portion of at least 5 amino acid residues interposed 

24 between the second fragment and the prey peptide and 

25 on interaction of the bait and a prey peptide the 

26 first and second Erz*~*w~** ^ of the fluorescent 

27 protein complement each other such that functional 

28 association of the first and. second fragments - - 

29 promotes fluorescence, wherein at least two of the 

30 prey proteins have different amino acid sequences. 
31 
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1 Preferably the linker portion of the first and / or 

2 second polypeptide comprises between 15 to 100 amino 

3 acid residues. 
4 

5 Preferably the linker of the first and / or second 

6 polypeptide is comprised of substantially 

7 hydrophillic amino -acid residues. 
8 

. 3 ~ . More preferably, the. linker, of the first, .and. A. or 

10 second polypeptide is comprised of multiples of a 

11 pentapeptide sequence such as glycyl-glycyl-glycyl- 

12 glycyl- serine, 
13 

14 More preferably the linker portion of the first and 

15 / or second polypeptide is greater than 2 0 amino .jr 

16 acids, more preferably greater than 25 amino acids, 

17 more preferably greater than 3 0 amino acids, more i 

18 preferably greater than 35 amino acids, even more i 

19 preferably greater than 40 amino acids, even more .z 

20 * preferably greater than 50 amino acids and yet more' 

21 preferably greater than 55 amino acids in length. 
22 

23 Preferably, the linker of the first and / or second 

24 polypeptide comprises up to 60 amino acids • 
25 

26 According to a fifth aspect of the present 

27 invention there is provided an assay method to 

28 determine peptide to peptide interactions comprising 

29 the steps of : 
30 

31 providing a first construct, said construct 

. 32 encoding a first fragment of fluorescent 
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1 protein, a first bait peptide and a linker 

2 portion of at least 5 amino acid residues 

3 interposed between the first fragment and the 

4 bait peptide ; 
5 

6 providing a plurality of second constructs said 

7 • constructs encoding a second complementary 

8 fragment of fluorescent protein, a prey peptide 

9_ ..- - >. -and. ..a . linker.. portion, of at . least .5 .amino acids ... . .. . . 

10 interposed between the second fragment and the 

11 • prey peptide wherein at least two constructs 

12 encode different prey proteins? 
13 

14 expressing both constructs in the same cell? 

15 and 

■ • "16. '" ~ 

17 detecting fluorescence produced in the cell. 

18 

19 Preferably all the second constructs encode 

20 different prey proteins. 
21 

22 Preferably the first and / or second construct 

23 comprises a linker portion which encodes between 15 

24 to 100 amino acid residues. 
25 

26 Preferably the first and / or second linker is 

27 comprised of substantially hydrophillic amino-acid 
..... .28.. .residues •• - • • 

29 

30 More preferably the first and / or second linker 

31 encodes multiples of a pentapeptide sequence such as 
3 2 glycyl -glycyl -glycyl -glycyl - serine . 
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1 

2 More preferably the linker of the first and / or 

3 second construct is greater than 20 amino acids, 

4 more preferably greater than 25 amino acids, more 

5 preferably greater than 3 0 amino acids, more 

6 preferably greater than 35 amino acids, even more 

7 preferably greater than 4 0 amino acids, even more 

8 preferably greater than 50 amino acids and yet more 
.9 preferably greater- than- 55 amino.. acids in length. 
10 

11 Preferably, the linker of the first and / or second 

12 construct encodes up to 60 amino acids. 
13 

14 In an embodiment of the assay the fluorescence 

15 detected may be quantitatively -determined such that 
1*6 fluorescence produced by different cells or under 
17 different conditions can be compared. 

18 

19 In one embodiment of the assay, the second construct- 

2 0 is provided as a member of a library of second * 

21 constructs wherein each member of the library 

22 encodes a different prey peptide wherein at least 

23 one second construct member of the library is • 

24 expressed in the same cell as the first construct 

25 encoding the bait protein. 
26 

27 The assay can therefore be used to screen an 

28 expression library to determine those peptides which 
2 9 bind to a bait peptide. 

30 

31 There is also provided an assay to determine peptide 

32 to peptide interactions comprising the steps of: 
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1 

2 providing a first polypeptide comprising a 

3 first fragment of fluorescent protein, a first 

4 bait peptide and a linker portion of at least 5 

5 amino acid residues interposed between the 

6 first fragment and .the bait peptide,- 
7 

8 providing a plurality of second polypeptides 
9. - s ..comprising, a .second fragment of . fluorescent . 

10 protein which, is complementary to the first 

H fragment of fluorescent protein, a prey peptide 

12 and a linker portion of at least 5 amino acids 

13 interposed between the second fragment and the 

14 prey peptide v/herein at least two second 

15 polypeptides encode different prey proteins; 
16 

17 mixing the first polypeptide and second 

18 polypeptide together; and 
19 

20 detecting whether fluorescence is produced. 
21 

2 2 Preferably the first and / or second polypeptide 

23 linker portion comprises between 15 to 100 amino 

24 acid residues. 
25 

26 Preferably first- ^rtd / or second polypeptide 

27 linker is comprised of substantially hydrophillic 
28.. . . amino-acid~res±dttes • ... 

29 

30 More preferably the first and / or second 

31 polypeptide linker is comprised of multiples of a 
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1 pentapeptide sequence such as glycyl-glycyl-glycyl- 

2 glycyl ^ serine . 
3 

4 More preferably the first and / or second 

5 polypeptide linker portion is greater than 2 0 amino 

6 acids, more preferably greater than 25 amino acids, 

7 more preferably greater than 30 amino acids , more 

8 preferably greater than 35 amino acids,, even more 

. preferably greater than .40 amino .acids,. even more • 

10 preferably greater than 50 amino acids and yet more 

11 preferably greater them 55 amino acids in length. 
12 

13 Preferably, the first and / or second polypeptide 

14 linlcer comprises up to 60 amino acids. 
15 

16 As detailed above the detected fluorescence can be 

• 17 quantitatively measured. 
IB 

19 In a particular example the assay method is 

20 ' performed in vitro. ' ... 
21 

22 The assay method may further, comprise the step of 

23 determining the location of the fluorescence in the 

24 cell. This is advantageous as it provides details 

25 of not only if a protein to protein interaction is 

26 occurring, but the location in the cell the 

27 interaction • is taking place, for example at the 

28 membrane, m in _ the ^cytoplasm, or. in the nucleus. 
29 

30 Xn addition, the assay method may further comprise 

31 the step of isolating the bait and / or prey peptide 

32 encoded from the cell in which fluorescence has 
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1 resulted, for example isolating a cell using a 

2 fluorescence activated cell sorting machine then 

3 isolating and sequencing the interacting peptides. 

4 The sequenced peptides can then be compared with 

5 sequences (full length or partial) in a data bank so 

6 as to identify or characterise the interacting 

7 peptide isolated from the cell. 
8 

„.-. vr $ -.. -The^-sequences. of .the .-jLnteracti-ng-.peptides -may ■ •_■ ■ • - 

10 alternatively be inferred by clonijag selected 

' ii • fluorescent cells and subjecting the cloned cells to 

12 PCR amplification and DNA sequencing. These 

13 sequences can then be cloned into expression vectors 

14 and the protein expressed and purified. The 

15 purified protein can be further studied or used for 

16 exaraplie in research. 
17 

18 In one embodiment, the assay method may further 

19 comprise the process of determining the subcellular 

20 dynamics of the peptide interactions visualised by 

21 fluorescence observations of living cells to enable 

22 spatio-temporal studies of peptide interactions 

23 throughout all parts of the cell cycle. 
24 

25 in a sixth aspect, which enables spatio-temporal 

26 studies, invent * «*> nrovides an assay which 

27 comprises the steps of providing a first construct 

28 encoding a polypeptide. :comprising a first fragment 

29 of fluorescent protein, a first bait peptide and a 

30 linker portion of at least 5 amino acid residues 

31 interposed between the first fluorogenic fragment 

32 and the first bait peptide; 
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1 

2 providing a second construct encoding a 

3 polypeptide comprising a second fragment. of 

4 fluorescent protein which is complementary to 

5 • said first fluorescent fragment , a second prey 

6 peptide and a linker portion interposed between 

7 the second f luorogenic fragment and the s&aandL 

8 P£^Y peptide; 

10 causing the expression of both constructs 

11 within the same living cell; and 

12 : 

13 and observing the level of fluorescence :i 

14 produced and its subcellular location in the 

15 cell at a range of time points following co- 

16 expression of both constructs", 
17 

18 Preferably the first and / or second construct ? 

19 comprises a linker portion which encodes between 15 
2 0" to 100* amino acid residues- 

21 

22 Preferably the linker of the first and / or second 

23 construct is comprised of substantially hydrophillic 

24 amino-acid residues . 
25 

26 More preferably the linker of the first and / or 

27 second construct is comprised of multiples of a 
.28 . pentapeptide. sequence .such as .glycyl^glycylrglycyl-. 

29 glycyl- serine* 
30 

31 More preferably the linker of the first and / or 

32 second construct is greater than 20 amino acids, 
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1 more preferably greater than 25 amino acids , more 

2 preferably greater than 30 amino acids, more 

3 . preferably greater than 35. amino acids,, even more 

4 preferably greater than 4 0 amino acids, even more 

5 preferably greater than 50 amino acids and yet more 

6 preferably greater than 55 amino acids in length . 
7 

8 Preferably, the linker of the first and / or second 
•9 . v • ». construct • encodes up.to - 60 amino -acids?' - *- - 
10 

11 In a seventh aspect, there is provided an assay for 

12 estimating the maximum possible separation of the 

13 fusion termini of the interacting peptides; 
14 

15 providing a first construct encoding a first 

16 fragment of fluorescent protein, a first bait 

17 peptide and a linker portion of at least 5 

18 amino- acid residues interposed between the 

19 first fragment and the bait peptide? 
2 0 ... _ 

21 providing a second construct encoding a second 

22 fragment of fluorescent protein which is 

23 complementary to said first fluorescent 

24 fragment, a prey peptide and a library of 

25 linkers of lengths ranging from 5 to 100 amino 

26 acids ; 
27 

2.8 r .~- expressing -both, constructs- in the same- cell 

29 following co-transf ection of a large population 

30 of cells with both constructs; 
31 
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measuring fluorescence produced in the cell, 
selection of those cells with higher 

fluorescence, using, .either a fluorescence 

activated cell sorting machine or alternatively 
by employing laser microdissection; and 

clonally amplifying these fluorescent cells, 
and sequencing the region of a large sample of 
• - . the constructs encoding the linkers and - ■ 
determining the length of the linkers . 

Preferably the linkers of the first and / or second 
construct are comprised of flexible pentapeptide 
sequences. 

Preferably the pentapeptide is comprised of 
substantially hydrophillic amino-acid residues. 
More preferably the pentapeptide is a sequence such 
as gly cy 1 -glycy 1 - glycy 1 - gly cy 1 - s erine . 

Preferably the number of peptapeptide sequences in 
the linker is determined by sequencing. 

Preferably the linker of the first and / or second 
construct length is between 10 to 100 amino acids, 
aih^rnati^-ly the linh^r can be between 15 to 100 
amino acids in length. In a yet further alternative 
the,, linker , can.. he. J20 to. ICQ . amino aoids : in length.. 
As a further alternative the linker can be 30 to 100 
amino acids in length. 
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1 A distribution of occurrence of linker lengths will 

2 be obtained' in the fluorescent cells selected, with 

. ... 3. . a sharp cutoff at. the lower limit reflecting the- - 

4 minimum linker length capable of spanning the 

5 separation of the fusion termini of the interacting 

6 peptides and thus allowing productive association of 

7 the fluorogenic fragments. A maximum value for this 

8 distance may be evaluated in Angstroms on the basis ■ 

♦ • - 9 -. . -that-each amino acid residue contributes 3 . 7A to tha^~ * 

10 length of each linker in an extended backbone 

11 conformation. 
12 

13 Further assay methods of the present invention may 

14 be used to detect the interactions of three or more 

15 agents in a trimeric or higher order complex. 
16 

17 In one example, three fluorescent fragments may 

18 provided by introducing two split points as 

IS discussed above into the fluorescent protein, each 

20- fragment being fused to a peptide. On interaction 

21 of the peptides the three or more fluorescent 

22 fragments are brought together such that they can 

23 functionally associate and generate a fluorescent 

24 signal capable of being detected. 
25 

26 In another w-uss^lc : "re cf the three 

27 fluorescent fragments can be fused to a test agent 

•« - -2& •*-.. .such-ae a small- molecule, --such -as a- metal ion-. . -- In- - . • 

29 this manner, protein interactions which require the 

30 participation of additional test agents, such as 

31 small molecules can be detected. 
32 
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1 Modulation of the interaction between peptides may 

2 be a desirable outcome in the treatment of certain 
3- clinical conditions, - or- as a research tool to study 

4 peptide to peptide interactions- For example, 

5 modulation of protein to protein interactions may 

6 facilitate the task of determining the steps of 

7 complex pathways by the provision of means to 

8 promote or inhibit a specific interaction, allowing 
'-9 V - -the- effects -of -other proteins to* be studied in ■ ■ -* 

10 better detail. 
11 

12 Many protein to protein interactions require the 

13 participation of small molecules or peptides. Such 

14 a requirement can be determined by simply adding 

15 small molecule ligands or the peptides to the 

16 components of the assay to determine if these 

17 modulate protein to protein interaction as measured 

18 by an alteration in fluorescent signal , 
19 

2 0 Thus in' an eighth aspect there is provided an assay 

21 for determining whether a candidate agent modulates 

22 protein to protein, interactions comprising the 

23 Steps : 
24 

25 providing a first construct encoding a first 

2£ fragment of fluorescent protein, a first bait 

27 peptide and a linker portion of at least 5 

28 amino acid residues interposed between the 
2 9 first fragment and the bait peptide,- 

30 

31 providing a second construct encoding a second 

32 fragment of fluorescent protein which is 
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1 complementary to said first fluorescent 

2 fragment, a prey peptide and a linker portion 
3. - .... . of at least 5 amino-acids interposed, between 
4 the second fragment and the prey peptide; 

5 

6 providing a putative modulating agent; 
7 

8 expressing both constructs in the same cell; 

• • z • ■''aiid - '" : ' *• ' ■ ■ '' * * '"' : ' 

10 

11 measuring fluorescence produced in the cell in 

12 che presence and absence of said putative 

13 modulating agent 
14 

15 wherein a reduction in fluorescence in the 

16 ' " presence of said modulating agent compared to 

17 fluorescence in the absence of said candidate 

18 modulating agent is indicative of inhibition of 

19 complex formation by the modulating agent and 

20 " an increase in fluorescence is indicative -of 

21 enhancement of complex formation by the 

o 

22 modulating agent. 
23 

24 Preferably the linker of the first and / or second 

25 construct comprises a linker portion which encodes 
between 15 to ICC J2=i= -cid residues. 
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Preferably the -linker- .of the-.f irst- and /. or second ..- 
construct is comprised of substantially hydrophillic 
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More preferably the linker of the first and / or 
second construct is' comprised of multiples of a 
pentapeptide sequence such as glycyl-glycyl-glycyl- 
glycyl- serine . 

More preferably the linker of the first and / or 
second construct is greater than 2 0 amino acids, 
more preferably greater than 25 amino acids , more 
.preferably.. greater, than 30. amino acids , more. ~. 
preferably greater than 35 amino acids , even more 
preferably greater than 4 0 amino acids r even more 
preferably greater than 5 0 amino acids and yet more 
preferably greater than 55 amino acids in length. 

Preferably , the linker of the first and / or second 
construct encodes up to 60 amino acids . 

In a ninth aspect there is provided an assay for 
determining whether a candidate agent modulates 
protein to protein interactions comprising the 
steps : 

providing a first polypeptide comprising 
a first fragment of fluorescent protein, a bait 
peptide and a linker portion of at least 5 
amino acid residues interposed between the 
first fragment and the bait peptide; 

providing a second polypeptide comprising a 
second fragment of fluorescent protein which is 
complementary to said first fluorescent 
fragment, a prey peptide and a linker portion 
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1 of at least 5 amino-acids interposed between 

2 the second fragment and the prey peptide; 

.3 . • • • - 

4 providing a putative modulating agent; and 

5 

6 measuring fluorescence produced in the presence 

7 and absence of said putative modulating agent 
8 

. 9 .. ....-...-wherein a reduction in fluorescence in. the • -. 

10 presence of said modulating agent compared to 

fluorescence in the absence of said candidate 
modulating agent is indicative of inhibition of 
complex formation by the modulating agent and 
an increase in fluorescence is indicative of 
enhancement of complex formation by the 
16 modulating agent. 

17 

Preferably the linker of the first and / or second 
polypeptide comprises between 15 to 100 amino acid 
20 residues: 
21 

22 Preferably the linker of the first and / or second 

23 polypeptide is comprised of substantially 

24 hydrophillic amino-acid residues. 



11 
12 
13 
14 
15 



IS 
19 



25 
26 



More preferably the linker of the first and / or 

27 second polypeptide is comprised of multiples of a 

2-8 • pentapepfcide sequence ,such-.as glycylTglycyl-glycyl^ 

2 9 glycyl - serine . 
30 

31 More preferably the linker of the first and / or 

32 second polypeptide is greater than 20 amino acids, 
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1 more preferably greater than 25 amino acids, more 

2 preferably greater than 3 0 amino acids, more 

3 preferably greater than 35 amino acids, even more 

4 preferably greater than 40 amino acids, even more 

5 preferably greater than 50 amino acids and yet more 

6 preferably greater than 55 amino acids in length. 
7 

8 Preferably, the linker of the first and / or second 

. . 3. v . construct polypeptide • is - up to 60 amino. acids. . , 

10 

11 Thus the above assay can be used to select compounds 

12 • capable of triggering, stabilising or destablising ; 

13 peptide to peptide interactions, 
14 

15 'As will be apparent , the assay of the present (, 

16 invention can be applied in a format appropriate for X 

17 large. scale screening, for example, combinatorial 

18 technologies can be employed to construct }■ 

19 combinatorial libraries of small molecules or % 
20* ' peptides to test as modulating agents. 

21 

- 22 Preferably, structural information on the peptide to 

23 peptide interaction to be modulated is obtained by 

24 testing different agents to determine if they are 

25 modulating agents . 
oc 

27 For example, each of the interacting pair can be 

- . • expressed and purified and then allowed, to interact . 

29 in suitable in vitro conditions • Optionally the 

30 interacting peptides can be stabilised by 

31 crosslinking or other techniques. The interacting 

32 complex can be studied using various biophysical 
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1 techniques such as X-ray crystallography, NMR, or 

2 mass spectrometry. In addition, information 

..3 . concerning the interaction can be derived through 

4 mutage ns ! experiments for example alanine scanning, 

5 or altering the charged amino acids or hydrophobic 

6 residues on the exposed surface of the bait or prey 

7 peptide being tested. 
8 

. 9 • Based on the structural information obtained-, - 

10 structural relationships between the interacting 

11 peptides slb well as between the modulating compound 

12 and the interacting peptides can be elucidated. 

13 Further, the three dimensional structure of the 

14 interacting moieties and / or that of the modulating 

15 compound can provide information to determine 
16' suitable lead compounds able to modulate 

17 interaction, which medicinal chemists can use to 

18 design analog compounds having similar moieties and 

19 structures. 
2 0 

21 In a tenth aspect, the invention provides compounds 

22 obtainable by an assay of the invention, for example 

23 small molecules, peptides or nucleic acids which 

24 interact with the peptides being tested and modulate 

25 the formation of a peptide complex. 
26 

27 Modulator compounds obtained accordingly to the 

28 -method of invention, may - be- .prepared as -a - - . . 

29 pharmaceutical preparation or composition . 

30 Such preparations will comprise the modulating 

31 compound and a suitable carrier, diluent or 

32 excipient* Tthese preparations may be administered 
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1 by a variety of routes, for example, oral, buccal, 

2 topical, intramuscular, intravenous, subcutaneous or 

3 the like.. 

4 . 

5 According to an eleventh aspect of the present 

6 invention there is provided a method of 

7 manufacturing a composition or preparation 

8 compris ing : 

10 performing an assay for determining whether a 

11 candidate agent modulates peptide to peptide 

12 interactions as described above; and 'i 

13 i 

14 formulating said agent into a composition. * 
15 

16 Also provided are nucleic acid constructs for use in ? 

17 the invention* 

18 • J.. 

19 Accordingly, in a twelfth aspect, there is provided 

20 a nucleic acid construct encoding- a fragment of a 

21 fluorescent protein, a peptide and a linker portion 

22 of at least 15 amino acid residues interposed 

23 between said fragment and said peptide, wherein said 

24 fragment of fluorescent protein is capable of 

25 functional association with a complementary fragment 
25 of fluorescent protein such that on functional 

27 association of said fragments fluorescence is 

-28 enabled.. ^. ~— 

29 

3 0 Preferably the first and / ox second construct 

31 comprises a linker portion which encodes between 15 

32 to 100 amino acid residues. 
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1 

2 Preferably the linker is comprised of substantially 

3 hydrophillic amino-acid residues. 
4 

5 More preferably the linker is comprised of multiples 

6 of a pentapeptide sequence such as glycyl-glycyl- 

7 glycyl-glycyl- serine . 
8 

.... 9.-.:. ..,Kore. preferably the .linker- port ion .encodes .greater. 

10 than 2 0 amino acids, more preferably greater than 25 

11 amino acids, more preferably greater than 30 amino 

12 acids, more preferably greater than 35 amino acids, 

13 even more preferably greater than 4 0 amino acids, 

14 even more preferably greater than 50 amino acids and 

15 yet more preferably greater than 55 amino acids in 

16 length. 
17 

IS According to a thirteenth aspect of the invention 

19 there is provided an expression vector comprising at 

20 - least one construct .encoding- a fragment of 3. 

21 fluorescent protein, a peptide and a linker portion 

22 of at least 15 amino acid residues interposed 

23 between said fragment and said peptide, wherein said 

24 fragment of fluorescent protein is capable of 

25 functional association with a complementary fragment 

26 of fluorescent rvr^***** 5 " a >idi that on functional 

27 association of said fragments fluorescence is 

28 .. enabled operably linked, to .at: least one regulatory 

29 sequence for the expression of tlie construct. 
30 

31 The vector can be introduced into the cell using any 

32 known techniques such as calcium phosphate 
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1 precipitation, lipofection, electroporatioa and the 

2 like. 
3 

4 Where two vectors are provided, and each vector 

5 encodes a different construct, for example a bait 

6 construct and a prey construct r the vectors can be 

7 transfected into the same cell or alternatively into 

8 two different cells which are subsequently fused 

9 ...together by r*<=n."L .fusion, or other : suitable . . 
10 techniques. 

11 

12 In a fourteenth aspect of the invention there is 

13 provided a cell transformed with a vector comprising 

14 at least one construct encoding a fragment of a 

15 fluorescent protein, a peptide and a linker portion 
i : 6 " of 'at least 15 amino acid residues interposed 

17 bet we en ssid fragment and said peptide., wherein said 

IB fragment of fluorescent protein is capable of 

19 functional association with a complementary fragment 

20 of fluorescent protein such that on functional 

21 association of said fragments fluorescence is 

22 enabled operably linked to at least one regulatory 

23 sequence fox the expression of the construct. 
24 

25 Cells which may be transformed include eukaryotic 

26 cells, such as yeast, insect , plant, mammalian, 

27 primate and human cells. Mammalian cells may be 

28 . primary cells or transformed cells, including tumour 

29 cells. The system is not restricted to intracellular 

30 (single cell) interactions. In multicellular 

31 organisms amenable to genetic manipulation, a 

32 protein-hapto-GFP construct could be released from 
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1 one cell or organ and be recognised by another 

2 protein- (receptor) -haptoGFP fusion to indicate 

3 localisation of filled receptors by the resultant 

4 fluorescent signal. 
5 

6 In cell free systems such additional proteins as 

7 required for expression may be included , for 

8 example, by being provided by expression from 

9 suitable recombinant expression vectors. 
10 

11 In addition, there is provided in a fifteenth aspect 

12 of the invention a polypeptide encoded by a 

13 construct encoding a fragment of a fluorescent 

14 protein, a peptide and a linker portion of at least 

15 15 amino acid residues interposed between said 

16 fragment and said peptide, wherein- said fragment, of - 

17 fluorescent protein is capable of functional 

18 association with a complementary fragment cf 

19 fluorescent protein such that on functional 

20 association of said fragments fluorescence is 

21 enabled . 
22 

23 In a sixteenth aspect of the invention there- is 

24 provided a library of polypeptides as encoded by 

25 constructs according to the fifteenth aspect of the 
2S invention. 

27 

28 Preferably the polypeptides of the library comprise 

29 a linker portion which encodes between 15 to 100 

30 amino acid residues. 
31 
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1 Preferably the polypeptides of the library comprise 

2 a linker of substantially fcydrophillic amino- acid 

3 residues. 
4 

5 More preferably the linker is comprised of multiples 

6 of a pentapeptide sequence such as glycyl-glycyl- 

7 glycyl -glycyl - serine . 
8 

9 More preferably the linker portion is greater than 

10 20 amino acids, more preferably greater than 25 

11 amino acids, more preferably greater than 30 amino 

12 acids, more preferably greater than 35 amino acids, 

13 even more preferably greater than 4 0 amino acids, 4 

14 even more preferably greater than 50 amino acids and 

15 yet more preferably greater than 55 amino acids in 

16 length.. - - . 
17 

18 Preferably, the linlter comprises up to 50 amino v 

19 acids. . 
20 

21 Preferably the first and / or second linker of the 

22 vector, or polypeptide can comprises between 15 to 

23 100- amino acid residues, 
24 

25 Preferably the first and / or second linker is * 

26 comprised of substantially hydrophillic amino-acid 

27 residues- " ~ ~ " 
28 

29 More preferably the first and / or second linker is 

3 0 comprised of multiples of a pentapeptide sequence 

31 such as glycyl-glycyl-glycyl-glycyl-serine. 
32 
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1 More preferably the first and / or second linker 

2 portion, encodes or comprises greater than 2 0 amino 

3 acids, more preferably greater than 25 amino acids, 

4 more preferably greater than -30 amino acids, more 

5 preferably greater than 3 5 amino acids r even more 

6 preferably greater than 40 amino acids, even more 

7 preferably greater than SO amino acids and yet more 

8 preferably greater than 55 amino acids in length. 

9 

10 According to a seventeenth aspect of the present 

11 invention there is provided a kit comprising at 

12 least one pair of constructs according to the first 

13 aspect of the invention and means to express the 

14 constructs. 
15 

16 The. kit. may further.. include, test agents, which may - 

17 enhance or inhibit peptide to peptide interaction. 
18 

13 In another embodiment the kit includes cell lines in 

20 which the vector of the third aspect can be 

21 expressed. 
22 

23 Alternatively the 1 kit can comprise' at least one" 

24 polypeptide of the fifth aspect of the invention and 

25 means for introducing the polypeptide into a cell. 
26 

27 Additionally, the kit can include instructions for 

28 using the kifc_to_ practise the present invention. 

29 The instructions should be in writing in a tangible 

30 form or stored in an "electronically retrievable 

31 form. 
32 
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Brief description of the figures 

The present invention will now be described with 
reference to the following non-limiting examples and 
with reference to the figures, wherein: 

Figure la is a ribbon diagram of EGFP annotated 
with split point sites; 

Figure lb is an illustration of the split 
points and the related sequences surrounding 
these split points of EGFP; 

Figure 2 is a representation of a hapto-EGFP It 
with a 26 residue linker between the 

f luorogenic fragments and the bait and prey »s8i 
proteins respectively; - x 

Figure 3 is a graph of the fluorescence ~ 
produced by the association of fragments joined ■ ■ 
to lingers of different lengths, (A) Cells 
cotransfected with pN157(e?)zip and pzip{4)C158 
•in which a functional leucine sippers mediate 
the association of haptoEGFPl-157 and 
haptoEGFP158-238 to generate fluorescence, (B) 
Negative control cotransf ection using pNl57(6") 
and p(4)C15S which lack sequences encoding the 
leucine zippers and as such fail to generate 
fluorescence, (D) Cells cotransfected with 
pN172(S)zip and pzip<4)C173 in which a 
functional leucine zipper mediated association 
of haptoEGFPl-172 and haptoEGFP173-238 occurs 



1 to generate fluorescence which is of greater 

2 intensity to that observed with the 157/158 

3 split point (E) Negative control 

4 cotransfection using pN172{£) arxd p(4)C173 

5 which lack sequences encoding the leucine 

6 zippers and as such fail to generate 

7 fluorescence, (C and'F) Confocal images of 

8 cotransfected cells from (A) and (D) showing 

9 the intracellular localisation of f liiorescence ... 

10 Vero cells were cotransf ected with plasmids 

11 encoding linkers ranging in length from 4 to 26 

12 amino acids and UV images were collected at 24 

13 hours post-hransfection using identical 

14 exposure times, <G) pN157 (6") zip and 

15 pzip(4)C158 (H) pN157 <ltf ) zip and pzip (14) C158 
_^16 ■ pN157(2S}-zip and (24) G158 (J) - -* ~ 

17 pN157 (26) 2ip and pzip{4}C158 (K) pN157 ( €) sip 

IB and pzip (24) C158 (L) a negative untransf ected 

19 control illustrates the background fluorescence 

20 level, Xtalici&ed figures in brackets indicate 

21 the length of the hydrophilic linker; 
22 

23 Figure 4 shows the ' importance of being able to 

24 fuse the interacting peptide to either the N, 

25 N' , C or C of the fluorescent fragment. 
26 

27 Structural studies of GPP have revealed that the 

28 protein exists as a compact cylindrical structure , 

29 with eleven beta-sheet strands forming the walls of 

30 the cylinder, the K and C termini being at close 

31 proximity at the base of the structure. Sections of 

32 alpha-helix form caps on the end of the cylinders 
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and an irregular alpha-helical segment also provides 
a scaffold for the fluorophore which is located in 
the geometric center of the cylinder. This folding 
motif, with beta-sheet outside and helix inside is 
known as beta-can. 

The inventors have shown that fluorescence can be 
generated following functional association of two 

^ B rJP ar .^ e . ?JT^ m -^ n ^ 3 ..°? molecules, ^(haptpGFPs), . when 

driven by the interactions of a pair of proteins 
fused both to the new C 1 and N 1 termini of each 
haptoGFP and also to the existing termini. 

Functional association of fragments' of fluorescent 
proteins, brought together by the interaction of * 
.peptides fused to the fragments, to screen for 
protein to protein interactions requires that the 

fragments reliably functionally associate only after r 

interaction of the fused peptides. 

Reliable functional association has to date not been 
obtainable due to the possibility of sterio 
hindrance' and steric constraints on the functional 
association of haptoGFPs when bulky proteins are 
associated to the GFP fragments. 

To overcome problems of steric hindrance , linker 
regions of at least 15 jresidues are provided between 
the peptide being tested for interaction and the 
associated f luorogenic fragment. This provides the 
peptide with considerable flexibility relative to 
the f luorogenic fragment to bind to another peptide 
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1 being tested while still enabling the f luorogenic 

2 fragments to complement each other and cause 

3 detectable fluorescence to be generated. 
4 

5 To prepare GFP fragments, which are capable of 

G functional association, split points were generated 

7 at various points along the 239 residue length, of 

8 the GPP protein, resulting in the generation of new 

9 C' and N* termini >rhich, in .three.. dimensions,, are 

10 located at the top and at the base of the beta-can 

11 structure . 
12 

13 Split points were introduced based on a structure 

14 driven approach between hydrophilic residues. The 

15 eleven strands of the beta structure making up the 
16- - beta-can topology of EGFP* are characterised by - 

17 forming three instances of a tripartite antiparaiiel 

IS sheet motif joined edge to edge around the periphery 

19 of the 'can 1 , with the remaining two beta strands 

20 completing 'the cylindrical structure. *Tiie most ■ 

21 successful split points obtained to date occur in 

22 the third tripartite motif between hydrophilic 

23 residues allowing adjacent hydrophobic side chains 

24 to promote refolding of the haptoGPPs * 
25 

26 As shown in the non exhaustive list of Table 1 a 

2 7 number of split points were identified using the 

28 above approach. It would appear „that .each split 

29 point in Table 1 is simply one example of a range of 

3 0 potentially useful split points , the range being 
31 shown in parentheses of Table 1. 

32 
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Table 1 



Split point 
Number 


Residue 
position in 
EGPP 


Possible 
range 


1 


23/24 


(23-25) 


2 


38/39 


(36-41) 


3 


50/51 


(48-54) 


4 


76/77 


(75-91) 


..5 ^ _. r , 


„ 8.9/90. ... 


(75-rSO). . .... . 


e 


102/103 


(101-103) 


7 


116/117 


(115-118) 


8 


132/133 


(129-143) 


9 


142/143 


(129-143) 


10 


157/158 


(155-160) 


11 


172/173 


(171-175) 


12 


190/191 


(187-199) 


13 


211/212 


(207-218) 


14 


214/215 


(207-218) 



To extend the versatility of the hapto-EGFP method, 
constructs were created where instead of using C 
and N l for the attachment of heterologus proteins, 
the endogenous termini, N or C together with one of 
the new N' or C termini were used. Using this 
technique the bait and prey peptides can be added., 
such that they are orientated to the associated 
fluorogenic fragments in the same direction as' each 
other, for example both attached to bottom of the p- 
can structure of GFP or in the opposite direction, 
for example the bait peptide is attached to the 
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1 bottom of the (3-can structure of GFP, while the prey 

2 protein is attached to the top of the fi-can 

3 structure of GFP, As will be understood by those . . 

4 skilled in the art, and as shown in figures 4 A & B, 

5 as peptides interact with each other in a particular 

6 orientation, then the direction of the linkage of 

7 the peptide to the N, N' , C or C end of the 

8 f luorogenic fragment becomes important in certain 

-9 - *• circumstances so as -to allow the fluorescent 'protein * 

10 fragments to functionally interact following 

11 interaction of the peptides * 
12 

13 Thus, to minimise interference with the refolding 

14 and association of the two hapto-EGFPs during 

IB assembly, it would appear that the most versatile 

16 split points may occur at the bottom of the (3- can. 

17 These effects may be minimised by the use of longer 

18 linkers to accommodate adverse topology (Figure 4C) . 
19 

20 Example 1 
21 

22 As^ shown_ in _ figures 2 and 3, hapto-EGFP. with a ,26-. .... 

23 residue linker between the fluorogenic fragments and 

24 the bait and prey proteins respectively were 

25 produced without loss of fluorescence. These 

26 linkers may jjts ^±i«xxctu. using overlapping 

27 oligonucleotides encoding repeating (GC3GGS) x units, 

28 -This was achieved by using unique Sac I and Bafrikl" 

29 restriction sites present in the core. expression 
3 0 vectors pN EGFP (Sac) zip and pzip (Bam) d***. 

31 
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1 
2 
3 

4 

5 

6 

7 



10 
11 



17 
IS 
19 
20 



To test whether it was possible to obtain 
fluorescence when endogenous N or C termini and a 
new N ' or C terminus are used to attach heterologus 
proteins the fusion (F) and haemagglutinin (H) 
membrane proteins of measles virus (MV) were used. 



Measles virus {MV) infection is mediated by a 

8 complex of two viral envelope proteins, 

9 haemagglutinin (H)_ glycoprotein and fusion, (F) ... . 
glycoprotein that bind to each other and then 
complex with surface receptors to aid the fusion of 

12 the virus with the plasma membrane. The H 

13 glycoprotein is dimerised in the endoplasmic 

14 reticulum and is thought to exist on the cell \ 

15 surface as a tetramer (dimer of dimers) . The fusion i 

16 (F) glycoprotein; is synthesised as an inactive 4 
precursor (F 0 ) which is a highly conserved type I } 
transmembrane glycoprotein of about SOkDa, which is % 
cleaved by fur in in the trans -golgi to yield the " : 
41kDa (fx) and the iskDa (f 2 ) disulphide- linked 

21 activated F-protein. Infection of the measles virus 

22 is dependent on the interaction of the F/H complex 

23 • with cell surface receptors. 
24 

25 A pair of constructs was generated which encoded the 

26 H glycoprotein fused at its N terminus to either the 

27 C terminal residue of hapto-EGFP 1 - 157 (n 1/1S7 ) in. the 

28 , .. first member > of to the natural C terminal residue 

29 of the complimentary hapto-EGFP 158 - 239 (C158/239) in 

30 the second member of the pair. Each construct 

31 includes an encoded linker between these two 

32 proteins. 
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1 Linkers generated using overlapping oligonucleotides 

2 which contain Sfi XA and Sfi IB restriction sites 

3 were introduced into pN 1/1S7 (16) zip and pC X58/33S (14) zip 

4 constructs- The complete H gene ORF was amplified 

5 by polymerase chain reaction (PCR) using primers 

6 containing Sfi IA and Sfi IB restriction sites and 

7 the PCR products used to generate pN 1/157 (16) MV-H and 

8 pC 158 ^ 239 (14) MV-H. A similar pair of constructs 

9 employing. N .and JKT .hapto-EGFP. fusions r pM^T-F C!6 ) ftV?- 57 . . • ------ 

10 and pMV-F (14) G 158/23a could be generated from existing 

11 clones using a similar strategy. 
12 

13 Vero cells (African green monkey kidney -derived cell 

14 line) were transiently transfected with pN 1/157 (16) MV- 

15 H and pMV-F (14) c 15 *' 238 constructs, the proteins 

- 16 expressed and phase contrast microscopy used to 

17 determine whether the modified glycoproteins 

.13 retained their f usogenicity , 
IS 

20 Real-time observation by ultraviolet and confocal 

21 microscopy indicated if fluorescence was generated 

22 upon expression of the hapto-EGFP/glycoprotein 
.23 *. fusions . 

24 

25 As cells transiently t ran s f ected with both F and H 

26 expression plasmids form syncytia in the absence of 

27 viral replication, the formation of syncytia can be 

28 used to assay fp£. successful transection of both 

29 plasmids. 
30 

31 The size of the syncytia was compared with controls 

32 to establish whether transfection had occurred. UV 
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1 

2 

3 
4 
5 
6 



11 



14 
15 
16 

IT 



and confocal microscopy were used to examine the 
fluorescence so as to verify that association 
between H protein oligomers and P proteins had taken 
place. Confocal microscopy and image reconstruction 
were also used to determine the intracellular 
localisation of H protein oligomers during formation 



7 of the fusion complex 
8 

9 

10 



Using the .above . vectors the intracellular - . . -, . ... . 
association of P and H proteins and their 
trafficking from the endoplasmic reticulum (ER) to 

12 the plasma membrane was tracked. Further, membrane " 

13 receptor proteins which interact with the H protein T 
could be identified as could cytoplasmic proteins * 
which interact with known MV receptors and which may * 
therefore initiate downstream signalling events. '~f 



Tne- above constructs could also be incorporated into 
a recombinant measles viral genome and the 
experiments repeated to determine if the above 
constructs could be used in In vivo viral studies, 



13 Sample 

IS 

20 
21 
22 
23 
24 

2 5 The type- 1 P glycoprotein is proposed to form 

26 trimers . 

27 
28 
29 
30 
31 



in this example two split points could i>e introduced 
into the EGFPr The constructs pMV^F (xe) N 1 ' 157 , pM V- 
P(14m xsB/i*o ^ pMV . p(14)Gl9 x/ 239 CQuld be generatedL 



'46 

1 The method could then be adapted to screen for and 

2 identify virus receptors. 
3 

4 This ' could be tested with MV and applied to the 

5 closely related mumps virus (MuV) . 
6 

7 Example 3 
8 

9 Fusion of oligonucleotides encoding haptfo-EGf.P^ _ 

10 sequences to members of a cDNA library. 
11 

12 Firstly, the sequence encoding the hapto-EGFP may be 

13 fused to the 5' end of the library due to the 

14 presence of downstream stop codons in the cDNA. 
15 

16 JS.ecoxidly^v.Gonsiructs .are- required to -be generated 

17 for all three reading frames to ensure that one is 

15 in the correct reading fraa»e. - 
19 

20 Thirdly, the cDNA sequences are required to" be • * 

21 obtained from a source which permits directional 

22 cloning into restriction sites which are extremely . 

23 rare in mammalian DNA , Such sequences are to be 

24 found in the Jkarge-Xnsert cDNA library (Clontech) , 
25 

26 A core panning vector could be engineered from 

27 existing plasmids to contain a CMV promoter, an 

2 8 initiation codon and sequences encoding a haptp-JEGER 

29 and "an intervening linker, an Sfi IA site and an Sfl 

30 IB site, a stop codon and an SV40 polyadenylation 

31 signal. Two additional screening vectors could be 

32 generated to include one and two extra nucleotides 
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13 
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16 
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21 
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1 between the linker and the Sfi IA site to correct 

2 the reading frame. cDNA fragments, flanked with Sfi 
IA and Sfi IB sites obtained from the library could 
be cloned downstream of the optimised hapto-EGFP 
linker constructs. The hapto-EGPP library could then 
be transfected into CHO cells and a mixed population 
of cells selected using G418 and passaged to 
confluency. These cells could then be transfected 
with CD46 r haptoEGFP or the equivalent .SLAM, plasmld . . 



11 Where interaction between the peptides being 

12 screened occurs, fluorescence is generated. 



Any cells which fluoresce can then be isolated by 

15 fluorescent laser microdissection and single cell ; : 
RT-PCR performed to identify raRNA which encodes 

peptides which interact with the cytoplasmic tails \ 



18 of the receptor molecules. 
13 
20 



Although the invention has been particularly " shown 
and described with reference to particular examples, 

22 it will be understood by those skilled in the art 

23 that various changes in the form and details may be 

24 made therein without departing from the scope of the 

25 present invention. 
26 
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Figure I 
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GBP fold: The ribbon diagram to the left is coloured similarly to the cartoon on the right, p-sheets are indicated by 
arrows, cc-helices by cylinders- Numbers within these symbols refer to sequence positions in EGFP (numbered 
according to the crystallagraphic structure - accession: 1 emb) r In the cartoon, connecting loops are shown by lines. 
Potential split points are starred and numbered sequentially from the N-termrnus, (see B below for precise 
defmitians). The fluorophore is represented by a green circle. 
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Possible split points in EGFP considered for h&pto formatioxL 



No. 


Position 


Top/ 
Bottom 


> - - 






Sequence- - 






1 


23/24 




T 1 




. JD V N 




K 










2 




- . -38/39 




B 










— G 






3 


50/51 




T I 




~SE C T 




K 










4 




76/77 




B 










-JR. 


y P D*H 2* K 


Q~ 


5 




89/90 




B 












W E g y 7„ 


6 


102/103 




T 1 




»*P J?" K 




G 










7 




116/117 




B 










■» JC 


F B T Ii 




8 


132/133 




T 1 




_D F K 


G 


X_ 








9 


142/143 




T 






. — H K h 




N 


y n_ 








10 




157/158 






B 












D K Q 4 ** JKT G 


x_- 


11 


: 172/173 




T 






it x 




G 


S 








n 




. 190/191 






B 












X G D^G P V 




13 


211/212 








«s K~ D 


E 










14 


.214/215 








-J? N E 




D 




ti — 1 







Adjacent hydrophobic residues are in italics. 

The colours in the vertical bar correspond to the stmctnral motifs of the cartoon. 
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Figure 2 



Library search and proximity 
measurement 



I. Prey Library 




Ha fa> EGFPs 



IL Single 'BAIT multiple 
linker library 



Schematic for protein to protein interaction searches by library mterrogatioH. The two 
proteins in question are designated 'Bait* and 'W 1 . Two libraries are generated (I and II), one 
series of constructs (here designated T. . „Z, library I, >10,000 members) encodes a hapto- 
EGFP followed by a DNA sequence encoding a 60 residue linker attached to the 5 f -end of a 
• cDNA library, which contains the gene encoding the 'prey', "W" here. The second series of 
constructs (a...e here, library II, <20 members) encodes the complementary hapto-EGFP 
followed by a degenerate linker DNA sequence and the 'bait 1 gene. The individual 
components of the system axe colour coded: blue - 'BaitVPrey*; pink - Linker; green - hapto- 
EGFP. All arrows indicate the direction of the polypeptide backbone (N->C) 

A* 'Prey 1 identification: co-transfection with the 'prey* library (T) and construct V (long 
linker - preferably 60 amino acid residues) from the 'bait 1 library (H) will generate 
fluorescent cells when the recipient cell receives a vector from library (I) bearing the W 
gene (in this case) and a second vector bearing the V bait construct Clonal expansion of 
these fluorescent cells allows identification of gene W\ 

B- Prqxfanity measurement: The clQne(s) frpm step : A.are co-transfected with_the 'baif, . 
library (H)> In this case cells showing fluorescence synthesise interacting proteins with a 
sufficiently long linker to allow productive complementary hapto-GFP interaction, (M 1 or V 
in this case), as shown to the left of the diagram- The hollow blue airows in the right hand 
part of the diagram are intended to indicate that the interaction of the gene products with 
these two constructs will generate fluorescence, while other interactions between the product 
of gene 'W and the bait protein will not give rise to fluorescent cells due to insufficient 
length of linker. 
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Figure 3 




Fluorescent images of Vero cells transiently cotransfected with haptoEGFP expression 
constructs: 
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Figure 4 
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